The fourth-order cumulant of speech signals with application to voice activity detection
نویسندگان
چکیده
This paper explores the fourth order cumulants (FOC) of the LPC residual of speech signals and presents a new algorithm for Voice Activity detection (VAD) based on the newly established FOC properties. Analytical expressions for the horizontal slice of the 4th cumulant as well as the kurtosis of voiced speech are derived based on a reported sinusoidal model [4]. The derivations demonstrate that the kurtosis of voiced speech is distinct from that of Gaussian noise and can be used to aid in detecting voicing. The proposed VAD combines FOC metrics with SNR measures to classify speech and noise frames. Its performance is compared to the ITU-T G.729B VAD [1] in various noise conditions, and quantified using the probability of correct and false classifications. The results show the proposed VAD has overall comparable performance to the G.729B: Its probability of false classification is lower in low SNR and Gaussian-like noise, but higher in speech-like noises.
منابع مشابه
A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملImproved Voice Activity Detection in the Presence of Passing Vehicle Noise
Voice activity detection (VAD) is an important enabling technology for a variety of speech-based applications including speech recognition, speech encoding, and hands-free telephony. The primary function of a voice activity detector is to provide an indication of speech presence in order to facilitate speech processing as well as possibly provide delimiters for the beginning and end of a speech...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملOptimal combination of fourth-order cumulant based contrasts for blind separation of noncircular signals
In this paper, we have considered the problem of blind source separation of noncircular signals. We have proposed a new Jacobi-like algorithm that achieves optimization of the optimal combination of complex fourth-order cumulant based contrasts. We have investigated the application to separation of non-Gaussian sources using fourth order cumulants. And computer simulations applied to noncircula...
متن کاملSignal parameter estimation using fourth order statistics: multiplicative and additive noise environment
Parameter estimation of various multi-component stationary and non-stationary signals in multiplicative and additive noise is considered in this paper. It is demonstrated that the parameters of complex sinusoidal signal, complex frequency modulated (FM) sinusoidal signal and complex linear chirp signal in presence of additive and multiplicative noise can be estimated using a new definition of t...
متن کامل